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Abstract 

Let P be a set of n points in an axis-parallel rectangle B in the plane. We 
present an 0(na(n) log^ n)-time algorithm to preprocess P into a data structure of 
size O {na{n) log"^ n), such that, given a query point q, we can find, in O(log^n) time, 
the largest-area axis-parallel rectangle that is contained in B, contains q, and its inte- 
rior contains no point of P. This is a significant improvement over the previous solution 
of Augustine et al. [4], which uses slightly superquadratic preprocessing and storage. 
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1 Introduction 



Let P be a set of n points in a fixed axis-parallel rectangle B in the plane. A P-empty rect- 
angle (or just an empty rectangle for short) is any axis-parallel rectangle that is contained in 
B and its interior does not contain any point of P. We consider the problem of preprocessing 
P into a data structure so that, given a query point q, we can efficiently find the largest-area 
P-empty rectangle containing q. This problem arises in electronic design automation, in the 
context of the design and verification of physical layouts of integrated circuits (see, e.g., [2T| 
Chapter 9]). 

The largest-area P-empty rectangle containing g is a maximal empty rectangle, namely, 
it is a P-empty rectangle not contained in any other P-empty rectangle. Each side of a 
maximal empty rectangle abuts a point of P or an edge of B. See Figure [T]for an illustration. 
Maximal empty rectangles arise in the enumeration of "maximal white rectangles" in image 
segmentation [5]. 
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Figure 1: A maximal P-empty rectangle containing q. 

The problem considered here can be formulated in more general settings by considering 
other classes of shapes that have to contain the query point (and be P-empty), and other 
kinds of bounding regions. To the best of our knowledge this problem was first introduced 
by Augustine et al. [1], who studied the case where the regions containing the query point 
are disks and the case where these regions are axis-parallel rectangles. For the case of disks 
they give a data structure that requires O(n^) space, 0(n^log?7,) preprocessing time, and 
can answer a query in 0(log^?7,) time. For the case of rectangles (the one also considered 
here) they give a data structure whose storage and preprocessing time are both 0{n^logn), 
and the query time is O(logn). 

Our result. We significantly improve the result of Augustine et al. for the case of axis- 
parallel rectangles, in terms of storage and preprocessing costs. Specifically, we present a 
data structure that requires 0{na{n)\og^ n) space and can be used to find the largest-area 
P-empty rectangle containing a query point q in 0(log^?7,) time. The structure can be 
constructed in 0{na{n) log n) time. Here a{n) is the slowly increasing inverse Ackermann 
function. 
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In a nutshell, our algorithm computes all the maximal P-empty rectangles and prepro- 
cesses them into a data structure which is then searched with the query point. A major 
problem that one faces is that the number of maximal P-empty rectangles can be quadratic 
in n (see, e.g.. Figure [5]), so we cannot afford to compute them explicitly (this issue was 
ignored in 

One of the main ingredients of our solution developed to overcome this difficulty and 
significant in itself, is a technique for handling partial inverse Monge matrices (see [T5| [TB] ) . 
Specifically, we observe that the areas of certain subsets of maximal empty rectangles can 
be arranged in a matrix satisfying the inverse Monge property; see below for details. The 
structure of standard Monge (and inverse Monge) matrices supports linear or near-linear 
algorithms (in the number of rows and columns of the matrix) for finding all the row maxima 
(or minima) in such a matrix. These algorithms usually avoid an explicit representation of 
the matrix, and instead compute the row maxima or minima by accessing only a small 
number of entries, each of which can be retrieved in 0(1) time. For full matrices only a 
linear number of entries is needed [1], and for certain kinds of structured partial matrices an 
almost linear number of entries suffices [151 CB] . 

We extend these basic techniques, and develop a data structure that supports efficient 
maxima queries in certain submatrices of a partial inverse Monge matrix. In a typical query 
of this kind we specify a row and a range of columns, and seek the maximum element of 
this row within this range. In another kind of queries, we specify a prefix of the rows and a 
prefix of the columns, and seek the maximum in the submatrix formed by these prefixes. A 
variation of our data structure has already been applied in a recent maximum flow algorithm 
for planar graphs [8] . Our data structure can be extended for general submatrix queries and 
is likely to find additional applications. 

Related work. An easier problem that has been studied more extensively is that of finding 
the largest-area P-empty axis-parallel rectangle contained in B. Notice that the largest 
P-empty square is easier to compute, because its center is a Voronoi vertex in the L^o- 
Voronoi diagram of P (and of the edges of B), which can be found in O(nlogn) time 
[121 [U] • There have been several studies on finding the largest-area bounded maximal empty 
rectangle [31 [101 [IS]; the fastest algorithm to date, by Aggarwal and Suri [2], takes 0(n log^ n) 
time and 0{n) space. Nandy et al. [TH] show how to find the largest parallel empty 

rectangle avoiding a set of polygonal obstacles, within the same time bounds. Boland and 
Urrutia [7| present an algorithm for finding the largest-area axis-parallel rectangle inside an 
ra-sided simple polygon in 0(n log n) time. Chaudhuri et al. [9] give an algorithm to find the 
largest-area P-empty rectangle, with no restriction on its orientation, in 0{ri^) time. 

The variant studied in this paper, of finding the largest P-empty rectangle containing 
a query point is newer and, as already mentioned, the only previous study of this problem 
that we are aware of is by Augustine et al. [1]. 
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2 Preliminaries 



We assume that the points of P are in general position, so that (i) no two points have the 
same x-coordinate or the same y-coordinate, and (ii) all the maximal P-empty rectangles 
have distinct 

One of the auxiliary structures that we use is a two-dimensional segment tree, which 
stores certain subsets of P-maximal empty rectangles. Here is a brief review of the structure, 
provided for the sake of completeness. Let be a set of axis-parallel rectangles in 
the plane. We first construct a standard segment tree S |T3] on the x-projections of the 
rectangles in Ai. This is a balanced binary search tree whose leaves correspond to the 
intervals between the endpoints of the x-projections of the rectangles. The span of a node 
V is the minimal interval containing all intervals corresponding to the leaves of its subtree. 
We store a rectangle R at each node v such that the x-projection of R contains the span 
of V but does not contain the span of the parent of v. The tree has 0{N) nodes, each 
rectangle is stored at O(logA^) nodes, and the size of the structure is thus O(A^logA^). All 
the rectangles containing a query point q must be stored at the nodes on the search path of 
the x-coordinate of q in the tree. 

For each node tt of 5 we take the set M.^ of rectangles stored at -u, and construct a 
secondary segment tree Su, storing the ?/-projections of the rectangles of M.u- The total size 
and the preprocessing time of the resulting two-dimensional segment tree is 0(A^log^ A^). 
We can retrieve all rectangles containing a query point q by traversing the search path tt of 
(the x-coordinate of) q in the primary tree, and then by traversing the search paths of (the 
?/-coordinate of) q in each of the secondary trees associated with the nodes along tt. The 
rectangles stored at the secondary nodes along these paths are exactly those that contain q. 
If we store at each secondary node only the rectangle of largest area among those assigned 
to that node, we can easily find the largest-area rectangle of M. containing a query point, in 
time 0(log A^). Storing only one rectangle at each secondary node reduces the size of the 
segment tree to 0(A^log A^), but the preprocessing time remains 0(A^log^ A^). 

This simple-minded solution will be efficient only when the size of A4. is linear or nearly 
linear in n. Unfortunately, as already noted, in general the number of maximal empty 
rectangles can be quadratic in the input size, so for most of them we will need an additional, 
implicit representation. The structure that we will use for this purpose is a partial inverse 
Monge matrix, so we first provide a brief background on Monge matrices. 

Monge matrices: A brief review. A matrix M is a Monge matrix (resp., an inverse 
Monge matrix) if for every pair of rows i < j and every pair of columns k < £ we have 
Mik + Mjg < Ma + Mjk (resp., Mik + Mjg > Ma + Mjk). A matrix is totally monotone if, for 
every pair of rows i < j and every pair of columns k < i, Mik < Ma implies Mjk < Mj£. It 
is easy to verify that an inverse Monge matrix is totally monotone. Aggarwal et al. [1] gave 
an algorithm for finding all row maxima in a totally monotone m x n matrix in 0{m + n) 
time. A partial totally monotone matrix is a matrix some of whose entries are undefined, 
but it satisfies the total monotonicity condition for every pair of rows i < j and every pair 
of columns k < I ioi which all four entries Mj^, Mn, Mjk-, and Mji, are defined. Klawe 
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and Kleitman [16] give an 0{na{m) + m)-time algorithm for finding all row maxima of 
staircase totally monotone matrices. These are partial totally monotone matrices in which 
the defined part of each row is contiguous, starting from the first column, and the defined 
part of each row is not smaller than the defined part of the preceding row. Klawe [15] also 
present an 0(nloglogm + m)-time algorithm for finding all row maxima in skyline totally 
monotone matrices, where the defined part of each column is contiguous starting from the 
bottommost row. All these algorithms for partially totally monotone matrices can find row 
minima instead of row maxima within the same time bound. 

3 The Data Structure 

Let P be a set of n points inside an axis-parallel rectangular region B in the plane. Recall that 
our goal is to preprocess P into a data structure, so that, given a query point g G S, we can 
efficiently find the largest-area axis-parallel P-empty rectangle containing q and contained 
in B. 

3.1 Maximal rectangles with edges on the boundary of B 

Let Ct, Cb, eg, and e,. be the top, bottom, left, and right edges of B, respectively. We classify 
the maximal P-empty rectangles within B according to the number of their edges that touch 
the edges of B. We show that there are only 0{n) maximal P-empty rectangles with at least 
one edge on dB. We precompute these rectangles and store them in a two-dimensional 
segment tree S, as described above. At query time we find the rectangle of largest area 
among those special "anchored" rectangles that contain the query point q, by searching with 
q in S. (The segment tree S will also store additional rectangles that will arise in later steps 
of the construction; see below for details.) 

Here is the classification and analysis of maximal P-empty rectangles R with at least one 
edge on dB. 

(i) Three edges of R lie on dB. It is easy to verify that there are only four such 
rectangles, one for each triple of edges of B. 

(ii) Two adjacent edges of R lie on dB. Suppose, without loss of generality, that the 
top and right edges of R lie on Cj and e^, respectively. The other two edges of R must be 
supported by a pair of maxima of P (that is, points p G P for which no other point q E P 
satisfies Xq > Xp and yq > yp), consecutive in the sorted order of the maxima (by their x- or 
^-coordinates). Since there are 0{n) pairs of consecutive maxima, the number of anchored 
rectangles of this kind is also 0{n). See Figure [2l The other three situations are handled in 
a fully symmetric manner. 

(iii) Two opposite edges of R lie on two opposite edges of B. Suppose, without 
loss of generality, that the left and right edges of R lie on eg and e,., respectively. In this 
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Figure 2: Maximal P-empty rectangles with two adjacent edges on dB. 



case the top and bottom edges of R must be supported by two points of P, consecutive in 
their y-order. Clearly, there are 0{n) such pairs, and thus also 0{n) rectangles of this kind. 
Again, handling the top and bottom edges of B is done in a fully symmetric manner. See 
Figure |3l 
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Figure 3: Maximal P-empty rectangles with two opposite edges on dB. 



(iv) One edge of R lies on dB. Suppose, without loss of generality, that the right edge 
of R lies on e^. Then the three other sides of R must be supported by points of P. For 
each point p & P there is a unique maximal P-empty rectangle whose right edge lies on Cr 
and whose left edge passes through p. This rectangle is obtained by connecting p to by 
a horizontal segment h and then by translating h upwards and downwards until it first hits 
two respective points of P, or reaches dB. (In the latter situations we obtain rectangles of 
the preceding types.) Hence there are 0{n) rectangles of this kind too. 

It is easy to compute all the maximal P-empty anchored rectangles of the above four 
classes, in overall 0(?7,logn) time: Computing rectangles of type (i) and (iii) only requires 
sorting the points by their x or y coordinates. Computing rectangles of type (ii) (of the 
specific kind depicted in Figure [2]) requires computing the list of maximal points. This can 
be done by scanning the points from right to left maintaining the highest point seen so far. A 
point p is maximal if and only if it is higher than the previous highest point. We can compute 
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Figure 4: A maximal P-empty rectangle with one edge on dB. 



the rectangles of type (iv) (of the specific kind depicted in Figure H]) also by traversing the 
points from right to left while maintaining the points already traversed, sorted by their y- 
coordinates, in a balanced search tree. When we process a point p then its successor and 
predecessor (if they exist) in the tree define the top and bottom edges of the rectangle of 
type (iv) whose left edge passes through p. 

We collect these rectangles and store them in our two-dimensional segment tree S. Given 
a query point we can find the rectangle of largest area containing q among these rectangles 
by searching in S", as explained above, in 0(log^ n) time. 

3.2 Maximal empty rectangles supported by four points of P 

In the remainder of the paper we are concerned only with maximal P-empty rectangles 
supported by four points of P, one on each side of the rectangle. We refer to such rectangles 
as hounded P-empty rectangles. We note that the number of such rectangles can be Q{v?) in 
the worst case; see Figure [5] for an illustration of the lower bound. The upper bound follows 
by observing that there is at most one maximal P-empty rectangle whose top and bottom 
edges pass through two respective specific points of P. (To see this, take the rectangle 
having these points as a pair of opposite vertices and, assuming it to be P-empty, expand it 
to the left and to the right until its left and right edges hit two additional respective points.) 
Handling these (potentially quadratically many) rectangles has to be done implicitly, in a 
manner that we now proceed to describe. 

We store the points of P in a two-dimensional range tree (see, e.g., [I3]). The points 
are stored at the leaves of the primary tree T in their left-to-right order. For a node u of 
T, we denote by P„ the subset of the points stored at the leaves of the subtree rooted at 
u. We associate with each internal node u oiT a, vertical splitter which is a vertical line 
separating the points stored at the left subtree of u from those stored at the right subtree. 
These splitters induce a hierarchical binary decomposition of the plane into vertical strips. 
The strip CTroot associated with the root is the entire plane, and the strip of a node u is 
the portion of the strip of the parent p{u) of u which is delimited by tp{u) and contains P^. 

With each node w in T we associate a secondary tree containing the points of P„ in a 



6 



B 





















o 

o 














o 

o 


o 

c 



























c 














o 









Figure 5: A set P of n points with 0(n^) maximal P-empty rectangles. 



bottom-to-top order. For a node v oiT^^ we denote by the points stored at the leaves of 
the subtree rooted at v. We associate with each internal node v of T„ a horizontal splitter 
ly, which is a horizontal line separating the points stored at the left subtree of v from those 
stored at the right subtree. These splitters induce a hierarchical binary decomposition of the 
strip o"„ into rectangles. The rectangle associated with the root of is the entire vertical 
strip a^i and the rectangle of a node v is the portion of the rectangle of the parent p{v) 
of V which is delimited by and contains P„. See Figure [61 
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Figure 6: The rectangle associated with a secondary node v, with its splitters and origin. 



In this way, the range tree defines a hierarchical subdivision of the plane, so that each 
secondary node v is associated with a rectangular region By of the subdivision. If v is not a 
leaf then it is associated with a horizontal splitter £y. If the primary node u associated with 
the secondary tree of v is also not a leaf then v is also associated with a vertical splitter £y. 
The vertical segment and the horizontal segment iy meet at a point Oy inside By, which 
we refer to as the origin of v. 

A query point q defines a search path tt^ in T and a search path in each secondary tree 
Tu of a primary node u on tt^. We refer to the nodes on these O(logn) paths as constituting 
the search set of q, which therefore consists of 0(log^?7,) secondary nodes. 

Let P be a bounded maximal P-empty rectangle containing q supported by four points 
Pt, Pb, Pe, and pr of P, lying respectively on the top, bottom, left, and right edges of R. Let 
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u be the lowest common ancestor of pi and in the primary tree, and let v be the lowest 
common ancestor of pt and in (clearly, both pt and p^ belong to T„). By construction, 
R is contained in and contains both q and o^. See Figure [71 Note that both v and u are 
internal nodes (each being an lowest common ancestor of two leaves) so is indeed defined. 
Furthermore, one can easily verify that v is in the search set of q. 
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Figure 7: A bounded maximal P-empty rectangle of the subproblem at v. 

In the following we consider only secondary nodes v which are not leaves, and are asso- 
ciated with primary nodes u which are not leaves. 

We define the subproblem at a secondary node v (of the above kind) as the problem of 
finding the largest-area bounded maximal P-empty rectangle containing q and Oy which lies 
in the interior of P„. It follows that if we solve each subproblem at each secondary node v in 
the search set of q, and take the rectangle of largest area among those subproblem outputs, 
we get the largest-area bounded maximal P-empty rectangle containing q. 

In the remainder of this section we focus on the solution of a single subproblem at a 
node V oi a. secondary tree in the search set of q. We focus only on the points in and 
for convenience we extend P„ to the entire plane and we move to the origin. The line 
iu becomes the ?/-axis, and the line becomes the x-axis. Put n„ = |P„|. We classify the 
bounded maximal P-empty rectangles contained in P„ and containing the origin according 
to the quadrants containing the four points associated with them, namely, those lying on 
their boundary, and find the largest-area rectangle containing q in each class separately. 

(i) Three defining points in a halfplane. The easy cases are when one of the four 
halfplanes defined by the x-axis or the y-axis (originally iu and i^) contains three of the 
defining points. Suppose for specificity that this is the halfplane to the left of the y-axis; 
the other four cases are treated in a fully symmetric manner. See Figure El Consider the 
subset Pe of points to the left of the y-axis. For each point p of Pe there is (at most) a single 
rectangle in this family such that p is its left defining point. Similarly to the analysis in case 
(iv) of Section 13.11 we obtain this rectangle by connecting p to the y-axis by a horizontal 
segment, and shifting this segment up and down until it hits two other respective points of 
Pi. Now we have a rectangle bounded by three points of Pg whose right edge is anchored to 
the y-axis. Extend this rectangle to the right until its right edge hits a point to the right 
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of the y-axis, to obtain the unique rectangle of this type with p on its left edge. (Here, and 
in the other cases discussed below, we assume that all the relevant points of P do exist; 
otherwise, the rectangle that we construct is not fully contained in the interior of B^. This 
would be the case, for example, if the shift of the above rectangle to the right of the y-axis 
does not encounter any point of P^.) Clearly, there are O(n^) bounded maximal empty 
rectangles of this type. 
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Figure 8: A bounded maximal P-empty rectangle with three defining points to the left of 
the |/-axis. 

We can find the part of each such rectangle which is to the left of the y-axis, by sweeping 

with a vertical line from the y-axis to the left maintaining the points already seen in a 
balanced search tree, exactly as we computed the empty rectangles of type (iv) in Section 
13.11 To find the part of each rectangle which is to the right of the y-axis we store the points 
to the right of the y-axis, sorted by their y-coordinates, in a search tree S^. With each node 
of Ej. we store the leftmost point stored in its subtree. We can identify the right edge of 
each rectangle R by using to find, in logarithmic time, the leftmost point to the right of 
the y-axis between the top and the bottom edges of R. 

Overall we can find all rectangles of this type associated with v in 0{ny\ogny) time. 
Summing this cost over all secondary nodes v, we obtain a total of 0{n log^ n) such rectangles, 
which can be constructed in O(nlog^n) overall time. 

We add all these rectangles to the global segment tree S. The size of the expanded tree 
S remains O(nlogn) since it still suffices to store only the largest-area rectangle among 
all rectangles associated with each secondary node. The preprocessing time increases to 
0(?7,log n) since each of the 0(nlog n) rectangles is mapped to 0(log n) secondary nodes 
of S, and for each rectangle R and a node u to which R is mapped, we need to check whether 
R is the largest rectangle mapped to u. A query in S still takes 0(log^?T,) time. 

The remaining cases involve bounded maximal P-empty rectangles R such that each of 
the four halfplanes defined by the y-axis or the x-axis contains exactly two defining points 
of R. This can happen in two situations: either there exist two opposite quadrants, each 
containing two defining points of R, or each quadrant contains exactly one defining point of 
R. 
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(ii) One defining point in each quadrant. The situation in which each quadrant con- 
tains exactly one defining point of R is also easy to handle, because again there are only 
0{ny) bounded maximal P-empty rectangles of this type in B^. To see this, consider, with- 
out loss of generality, the case where the first quadrant contains the right defining point, 
Pr, the second quadrant contains the top defining point, pt, the third quadrant contains the 
left defining point, pi, and the fourth quadrant contains the bottom defining point, pi,. See 
Figure M (There is one other situation, in which the top defining point lies in the first 
quadrant, the right point in the fourth quadrant, the bottom point in the third, and the left 
point in the second; this case is handled in a fully symmetric manner.) 
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Figure 9: A bounded maximal P-empty rectangle with one defining point in each quadrant. 

We claim that pr can be the right defining point of at most one such rectangle. Indeed, if 
Pr is the right defining point of such a rectangle R then pi, is the first point we hit when we 
sweep downwards a horizontal line segment connecting p^ to the y-axis (assuming the sweep 
reaches below the x-axis; otherwise Pr cannot be the right defining point of any rectangle of 
the current type). Similarly, the point pe is the first point that we hit when we sweep to the 
left a vertical line segment connecting p^ to the x-axis, pt is the first point we hit when we 
sweep upwards a horizontal line segment connecting pi to the y-axis, and finally Pr is the first 
point we hit when we sweep a vertical line segment connecting pt to the x-axis. As noted, if 
any of the points we hit during these sweeps is not in the correct quadrant, or the last sweep 
does not hit pr (e.g., because the point pt is lower than pr), or one of the sweeps does not 
hit any point before hitting dB^, then pr is not the right defining point of any rectangle of 
this type. 

We compute these 0{ny) bounded maximal P-empty rectangles using four balanced 
search trees. As for rectangles of type (i) we maintain the points to the right of the y- 
axis in a balanced search tree S^, sorted by their y coordinates, storing with each node the 
leftmost point in its subtree. Similarly, we maintain the points below the x-axis in a balanced 
search tree sorted by their x coordinates, storing with each node the topmost point in its 
subtree. We maintain the points to the left of the y-axis and the points above the x-axis in 
symmetric search trees and Sf, respectively. We can find each rectangle in this family by 
four queries, starting with each point Pr in the first quadrant, first in T,}, to identify pb, then 
in to find p£, in to find pt, and finally in to ensure that we get back to Pr- 
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Summing over all secondary nodes v, we have 0{nlog^ n) such rectangles, which we can 
construct in 0{n log^ n) overall time. We add them too to the global segment tree S", without 
changing the asymptotic bounds on its performance parameters, as discussed earlier. 

3.3 Two defining points in the first and third quadrants 

The hardest case is where, say, each of the first and third quadrants contains two defining 
points of R. (The case where each of the second and fourth quadrants contains two defining 
points is handled symmetrically.) The defining points in the first (resp., third) quadrant are 
consecutive minimal (resp., maximal) points of the subset of P„ in that quadrant. There are 
0{n^) such pairs. Denote the sequence of maximal points of the third quadrant by and 
the sequence of minimal points of the first quadrant by F, both sorted from left to right (or, 
equivalently, from top to bottom). 

Consider a consecutive pair (a, 6) in E (with a to the left and above h). Let Mi be 
the unique maximal P-empty rectangle whose right edge is anchored at the y-axis, its left 
edge passes through a, its bottom edge passes through 6, and its top edge passes through 
some point c (in the second quadrant); it is possible that c does not exist, in which case 
some minor modifications (actually, simplifications) need to be applied to the forthcoming 
analysis, which we do not spell out. 

Let M2 be the unique maximal empty rectangle whose top edge is anchored at the x-axis, 
its left edge passes through a, its bottom edge passes through 6, and its right edge passes 
through some point d (in the fourth quadrant; again, we ignore the case where d does not 
exist). See Figure [TUl Our maximal empty rectangle cannot extend higher that c, nor can 
it extend to the right of d. Hence its two other defining points must be a pair {w^ z) of 
consecutive elements of F, both lying to the left of d and below c. The minimal points which 
satisfy these constraints form a contiguous subsequence of F. 

That is, for each consecutive pair p = (a, b) of points of E we have a contiguous "interval" 
Ip C F, so that any consecutive pair n = {w,z) oi points in Ip defines with p a maximal 
empty rectangle which contains the origin, and these are the only pairs which can define 
with p such a rectangle. (Note that we can ignore the "extreme" rectangles defined by a, b, c, 
and the highest point of Ip, or by a, b, d, and the lowest point of Ip, since these rectangles 
have three of their defining points in a common halfplane defined by the x-axis or by the 
y-axis, and have therefore already been treated.) 

To answer queries with respect to these rectangles, we process the data as follows. We 
compute the chain E of maximal points in the third quadrant and the chain F of minimal 
points in the first quadrant, ordered as above. This is done in 0{ni,\ogny) time in the same 
way as we computed the chain of maximal points of P in Section ISTTl For each pair p = [a, b) 
of consecutive points in E we compute the corresponding delimiting points c (in the second 
quadrant) and d (in the fourth quadrant). Formally, c is the lowest point in the second 
quadrant which lies to the right of o, and d is the leftmost point in the fourth quadrant 
which lies above b. We then use c and d to "carve out" the interval Ip of F, consisting of 
those points that lie below c and to the left of d. We can find c by a binary search in the 
chain of y-minimal and x-maximal points in the second quadrant, and find d by a binary 
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Figure 10: The structure of maximal empty rectangles with two defining points in each of 
the first and third quadrants. 

search in the chain of x-minimal and ?/- maximal points in the fourth quadrant. These chains 
can be computed in the same way as in the construction of E and F. Once we have the 
chains we can find, for each consecutive pair p = [a, b) in E, the corresponding entities c, d, 
and Ip, in Oilogn^) time. 

We next define a matrix A as follows. Each row of A corresponds to a pair p of consecutive 
points in E and each column of A corresponds to a pair n of consecutive points in F. If at 
least one point of tt is not in Ip then the value of is undefined. Otherwise, it is equal to 
the area of the (maximal empty) rectangle defined by p and n. By the preceding analysis, 
the defined entries in each row form a contiguous subsequence of columns. It is easy to verify 
that if p2 follows (i.e., lies more to the right and below) pi on E then the left (resp., right) 
endpoint of Ip^ cannot be to the right of the left (resp., right) endpoint of /p^; See Figure [TTl 
It follows that in each column of A the defined entries also form a contiguous subsequence 
of rows. We refer to such partially defined matrix A in which the defined part of each row 
and of each column is consecutive as a double staircase matrix. 

The following simple lemma plays a crucial role in our analysis. 

Lemma 3.1. Let xi, X2, Hi, 1/2 be four points in the plane, so that Xi and X2 lie in the first 
quadrant, yi andy2 lie in the third quadrant, Xi lies northwest to X2, andyi lies northwest to 
y2- For any point w in the third quadrant and any point z in the first quadrant, let R{w, z) 
denote the rectangle having w and z as opposite corners, and let A{w, z) denote the area of 
R{w,z). Then we have 

A{yi, xi) + A{y2, X2) > A{yi, X2) + A{y2, xi). (1) 
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Figure 11: The structure of the defined portion of A. 



Proof. The situation is depicted in Figure [T2j In the notation of the figure we have 

A{yi, xi) + A{y2, X2) = A{yi, X2) + A{y2, xi) + Ai + A2, 
where Ai and A2 are the areas of the two shaded rectangles. 
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Figure 12: The inverse Monge property of maximal rectangles. 



Lemma [HIT] asserts that if Ap^^,^, ^p27r2) ^pi-n2i Ap^^^,^, for pi < p2 and tti < 712, are all 
defined then 

or, equivalently, 

Hence A satisfies the inverse Monge property, with respect to its defined entries, so it is a 
partial inverse Monge (and thus also totally monotone) matrix. 

3.3.1 Answering a query in the first (or third) quadrant 

The next step is to compute the column maxima in A. That is, for each pair vr of consecutive 
points in F, we compute the value A^a.^i'^) = maXpAp^, where p ranges over all consecutive 
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pairs in E for which vr is contained in Ip. The fact that A is only partially defined makes this 
task slightly more involved than the similar task for totally defined inverse Monge matrices. 
Intuitively, this computation is similar to the construction of an upper envelope of pseudo- 
segments in the plane. Indeed, we can think of the entries of a particular row p as forming 
the graph of a (discrete) partially defined function Ap{-), mapping indices tt of columns to 
the areas of the corresponding rectangles. Equation ([2]) implies that these functions 
behave as pseudo-segments. Specifically, we extend the domain of definition of each function 
Ap to a segment, delimited by the first and last pairs tt at which Ap^^ is defined, by linearly 
interpolating between each pair of 7r-consecutive points on its graph. Then ([2]) implies that 
each pair of the resulting connected polygonal curves intersect at most once. 

The complexity of the upper envelope of m pseudo-segments is 0{ma{m)) (see [20]). 
More precisely, this expression bounds the number of breakpoints of the envelope (points 
where two distinct graphs intersect on the envelope; for technical reasons we also regard 
the leftmost and rightmost points of each graph as breakpoints), and ignores the complexity 
of the individual functions (the graph of each of our functions consists of many segments, 
one fewer than the number of columns where A is defined at the corresponding row, and 
these individual complexities are ignored in the bound above); this comment is crucial for the 
complexity analysis of our procedure. Since we can find the intersection of any pair of pseudo- 
segments Ap-^ and Ap^ in 0(log ri) time, by a binary search through the relevant columns, 
we can compute this upper envelope in 0(mQ;(m) logmlogn) time, by a simple divide- 
and-conquer algorithm, or in 0(m logmlogn) time, using the more elaborate algorithm of 
Hershberger [14] . 

In accordance with the remark made in the preceding paragraph, we note that in the 
algorithm just sketched we do not attempt to compute and output the upper envelope 
explicitly — this will take Q{n) time at each recursive step, for filling in the value of the 
envelope at every column. Instead, we only compute its breakpoints, which partition the 
columns into 0{ma{m)) blocks, so that for all tt in the same block, maxpy4p^ is attained 
by the same row. This implicit representation is significantly cheaper when m <^ n, and is 
crucial to obtain the running time asserted above. 

Remark. Rather than adapting the divide-and-conquer algorithm for upper envelopes, 
just mentioned above, to our discrete settings, we can use an algorithm of Klawe [15] for 
computing row maxima in staircase (inverse) Monge matrices. Recall that a staircase matrix 
is a partially defined matrix in which the defined portion of each row is contiguous starting 
at the leftmost column, and the defined part of each row is not smaller than the defined 
part of the preceding row. We shall also refer as a staircase matrix to a matrix that can 
be made staircase by inverting the order of the rows and the columns. (Note that the 
operation of inverting the order of the rows and the columns preserves the inverse Monge 
property.) Klawe shows how to find row maxima in a staircase totally monotone matrix in 
0{na{m) + m) time, where m is the number of rows and n is the number of columns. 

We can use Klawe's algorithm as follows; refer to Figure [T31 We split the matrix into 
four submatrices Ai, . . . , A4 at the middle row, where Ai and A2 are formed by the first half 
of the rows and A3, and A4 are formed by the second half. We take the contiguous block of 
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Figure 13: The recursive construction using Klawe's algorithm. 



columns whose defined portions intersect the middle row, split each of these columns at the 
middle row, and form Ai from the top parts of these columns and from from the bottom 
parts. The submatrix A2 (resp., A4) is defined by the rows above (resp., below) the middle 
row and by the columns whose defined portions are fully contained in this range of rows. 
Clearly, Ai and ^3 are two staircase submatrices (one straight and one inverted). 

It follows that we can find column maxima in Ai and A^ by two applications of Klawe's 
algorithm to A\ and A\, and then by taking the maximum of the two relevant outputs for 
each column. (Formally, Klawe's algorithm finds row maxima and we need column maxima, 
but since the transpose operation preserves the inverse Monge property, the application of 
Klawe's algorithm to the transposed matrix yields the desired column maxima.) We then 
recursively apply the algorithm to the submatrices y42, A4. Note that A2 and A^ are disjoint 
submatrices of A, each with half as many rows, and their column ranges are disjoint, and 
also disjoint from the column ranges of Ai and ^3. This is easily seen to imply that the 
running time of this recursive algorithm is 0{ma{n) logm + n). 

Although this algorithm is faster (when n = 0{m)) than the one based on computing 
the upper envelope of pseudo-segments, the latter will be used again later, when we show 
how to handle query points in the second or fourth quadrants. 

By construction, the upper envelope of the pseudo-segments corresponding to the rows 
of A records the column maxima of A. Specifically, we scan the upper envelope from left to 
right, and the maximum for each column occurs at the row that attains the upper envelope 
at that column. (We can afford to perform this scan once, upon termination of the whole 
procedure, but not at each of the recursive steps of constructing sub-envelopes.) 

After computing the column maxima in A we build a range-maxima data structure stor- 
ing these column maxima, so that we can efficiently retrieve the maximum in any query 
contiguous subsequence of the columns. Such a structure can be constructed in time (and 
storage) linear in the number of columns, and a query can be answered in 0(1) time (see [6] 
and the reference therein for the original results). For our purpose, though, since we have to 
search F to identify the interval of columns that the query point q "controls" , we might as 
well use a standard binary search tree over the columns, instead of the more sophisticated 
structure of [0] . We store in each subtree of the tree the maximum of the column maxima, 
over all columns stored at the subtree, which allows us to find the maximum in a query 
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interval of columns in O(logn) time. 

The query point q itself, if it lies in the first quadrant, defines a contiguous subsequence 
Jq of the sequence F of minimal points in the first quadrant, namely, those that lie above q 
and to its right. Only consecutive pairs within this subsequence can form the top and right 
defining points of a maximal empty rectangle containing q of the type considered here. So 
we compute Jg, in logarithmic time, and compute max^ ^max(7r), over all pairs tt contained 
in Jg, using the range-maxima data structure just described, and output the corresponding 
rectangle. 

As described so far we need two search trees, one is the range-maxima data structure 
over the column maxima of A, and the other is a search tree over F which we use to identify 
the subsequence Jq carved out from F by a query point q. Since a column in A corresponds 
to a pair of consecutive points of F we can in fact use only one search tree for both purposes. 
This search tree is over the points in F and it stores in each node v the largest of the column 
maximum of the columns which are associated with pairs of consecutive points in the subtree 
of V. 

A query with a point in the third quadrant is handled in a fully symmetric manner, using 
a symmetric data structure in which the roles of E and F are interchanged. The cases where 
the query is in the second or fourth quadrants will be considered next. 

This structure for queries in the first quadrant uses an additional binary search tree for 
range maxima queries, for each secondary node in T. The total size of these structures is 
0{n\og^ n), and they can all be constructed in O(nlog^n) overall time, using Hershberger's 
algorithm [H] (In each secondary node v we need 0{n^\og n^,) time to compute the upper 
envelope of the functions defined by the rows of the matrix associated with v.) A query 
takes 0(log^ n) time, because we spend logarithmic time at each secondary node of T on 
the search paths, for which q is in the first or third quadrant of B^. 

3.3.2 Answering a query in the second (or fourth) quadrant 

Consider next the case where q is in the second quadrant of (the case where q is in the 
fourth quadrant is handled in a symmetric manner). Consider the prefix Fq of F consisting of 
points whose ^/-coordinate is larger than that of and the prefix Eq of E consisting of points 
whose x-coordinate is smaller than that of q. The rectangles defined by pairs of consecutive 
points in E and in F which contain q are exactly those defined by pairs with at least one 
point in Eq and one point in Fq. See Figure [T4l 

Here is an overview of our approach. We use the same matrix A defined in the preceding 
subsection. We store the rows of A in a balanced binary search tree T^. Each node u of 
Th stores the upper envelope £u of the pseudo-segments corresponding to the rows in the 
subtree of u. Given a query q in the second quadrant, we compute Eq, retrieve the pair pq 
formed by the last point of Eq and the next point of E (if such a point exists; otherwise we 
form the last pair in Eq = E), and represent the first pq rows of A as the disjoint union of 
O(logn^) canonical subsets of rows, corresponding to a collection Nq of 0(log?7,^,) respective 
nodes of T^. See Figure [T^ for a schematic depiction of this structure. 

We next compute Fq and its "last pair" vTg, defined analogously to pq. What we need to 
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Figure 14: Querying with a point in the second quadrant. The highhghted points form the 
prefixes Eg and Fq, plus one extra point in each chain. 



do is to compute max{£^„(7r) \ tv < TVg}, over all nodes u & Ng, and return the largest of these 
values (along with its corresponding rectangle). 

However, we cannot afford to enumerate the values of the envelopes Su for all nodes 
u E Th explicitly, because we may have Q^n^) envelopes, each consisting of 6(nt,) values, 
so we may need quadratic storage for an explicit representation of the envelopes. (This is 
the same problem that we faced in the preceding subsection.) We therefore need an implicit 
representation that would still allow us to compute the maximum of an envelope within a 
query prefix range vr < vr^, in polylogarithmic time. 

To do so, we use the compact representation of an envelope by its breakpoints, as used 
in the preceding case. The divide-and-conquer construction of the upper envelope of the 
entire range of rows of A, described in the preceding subsection, yields as a by-product all 
the upper envelopes Su, over all nodes uoiTh. Each envelope £u is represented as a sequence 
of 0(mua(m„)) intervals of columns, where is the number of rows stored at u (the size 
of the subtree of tt), so that, over each interval, £u is attained by some fixed row. 

We thus face the following subproblem. We are given an upper envelope £ui defined at a 
node u of which spans rows of A, as a sequence of 0(m^ja(m„)) intervals of columns 
delimited by breakpoints of £u (as above some of these breakpoints may be endpoints of 
the domains of definition of some rows). Our goal is to preprocess £u into a data structure 
so that, given a prefix range of columns vr < tt^, we can compute max{£„(7r) | vr < tt^} 
efficiently. In the spirit of the preceding discussion, the preprocessing has to take time that 
is near-linear in m, and cannot afford an explicit enumeration of Instead, we use the 
following approach. 

The compact representation of £u calls for the design of a black-box routine that receives 
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Figure 15: A single node u of the tree Th and a schematic representation of its upper envelope 
S 



as input a row p and an interval [7i"i,'/r2] of columns, and returns maxjAp^ | tti < tt < 7r2}. 
Having such a routine at hand, we first compute, by repeated calls to the black-box routine, 
the maximum value of over each of its 0{mua{mu)) intervals (recalling that within each 
of these intervals Su is attained by a single row), then compute the cumulative maxima, for 
each prefix of this sequence of intervals, and store these prefix maxima in an array. Then, 
given a query index tt^, we find the largest prefix of intervals that fully precede Hq, retrieve 
the cumulative maximum of this prefix, and make one more call to the black-box routine to 
retrieve max{ApT^ \ TTmax + 1 < tt < ttq}, where TTmax is the index of the last column of the 
last complete interval in the prefix, and where p is the row attaining the envelope Su at the 
next interval. We return the maximum of the output of this call and the retrieved prefix 
maximum. 

To implement this black-box routine, we apply a simpler variant of the construction de- 
scribed so far, to the transposed matrix A*. That is, we store the rows of A*' (originally, 
columns of A) in a balanced binary tree T^, and apply a divide- and-conquer procedure for 
computing, for each node w of T^, the upper envelopes S^^ of the pseudo-segments corre- 
sponding to the rows of A* stored at w, again, representing each envelope £^4 ^ sequence 
of 0{muja{muj)) intervals, where rriw is the size of the subtree of w. Now, given a query 
(p, [7ri,7r2]), we search in and obtain a representation of the interval [7ri,7r2] as the dis- 
joint union of 0(logn„) canonical intervals of rows of A* (columns of A), corresponding 
to O(logn^) nodes of T^. For each such node w, we retrieve £i,{p), in Oilogrriu,) time, 
by searching with p through the sequence of intervals representing £l^. The maximum of 
these 0(logn„) values is the desired maximum that we seek. The overall cost of this 

computation is 0{}.o^ n^). 
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To complete the analysis, we next bound the storage, preprocessing cost, and query time 
for the entire structure. 

Both trees Th and Tj^ associated with a secondary node v of the range tree T are of size 
0{n^a{ny) logn^), including all secondary search trees over the upper envelopes, and it takes 
a total of C>(n^a(n^) log^ Uy) time to construct them. (The divide-and-conquer algorithm 
does in fact produce all the sub-envelopes at all the nodes of or of Tj^, at a particular 
secondary node v, within the above time bound.) For each node w G we compute the 
maximum in each interval of £u using Tj^. This takes 0(?7.^a;(n^) log^ n„) time. 

Summing over all secondary nodes v in T, we obtain that the size of the entire range 
tree (including the respective trees Th and and a search tree over each envelope Su (resp., 
£D for each node u in (resp., T^)) is O {na{n) log^ n). The total preprocessing time is 
0{na{n) log^ n) time, because the cost of processing each block of each envelope £u, for nodes 
u of Th, is 0{log^ny), using the black-box routine. 

A query in each secondary node v in which q falls in the second or fourth quadrant of 
By takes O (log^ Uy) time. This running time follows since we need to perform one binary 
search in Th to locate the 0{logny) nodes A''^ representing the prefix Eg. In each such 
node u ^ Nq we perform another binary search in Su to find the longest prefix of intervals 
that fully precede tt^ and retrieve the cumulative maximum of this prefix. Finally in each 
such node u we make one more query to the data structure representing to retrieve 
max{y4p7r I TTmax + l < 71" < TTg}, where TTmax is the index of the last column of the last complete 
interval in the prefix of intervals fully preceding vr^, and where p is the row attaining the 
envelope Ey at the next interval, which contains tt^. This last query takes 0(log Uy) time, 
from which the overall cost of 0{\og^ n„) at v follows. This dominates the logarithmic query 
cost at nodes v where q lies in the first or third quadrant, and, summed over all secondary 
nodes v of T, yields an overall 0(log^ n) query time. 

We recall that the entire presentation caters to maximal P-empty rectangles having 
two defining points in the first quadrant of By and two in the third quadrant. To handle 
rectangles having two defining points in each of the second and fourth quadrants, we prepare 
a second, symmetric version of the structure in which the roles of quadrants are appropriately 
interchanged, and query both structures with q. 

We can reduce the query time and the preprocessing time required at a secondary node 
V hj a. logarithmic factor using fractional cascading [11]. This technique allows us to insert 
bridges between the envelopes corresponding to the nodes w of Tj^ so that once we locate the 
interval covering a particular column p (of A^) in S^^ we could locate the interval containing 
p in the envelope of a node w' adjacent to w in 0(1) time. This allows us to construct 
a data structure over so the maximum in a particular row p of A and a range of columns 
[7ri,vr2] of A can be found in 0{\ogny) time instead of (9(log^n^,) time. This modification 
does not incur any space overhead. 

In summary, we obtain the following main result of the paper. 

Theorem 3.2. The data structure described above requires 0{na{n) log^n) storage, and can 
be constructed in O {na{n) log^ n) time. Using the structure, one can find the largest-area 
P -empty rectangle contained in B and containing a query point q in 0{\og^n) time. 
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4 Submatrix maxima in totally monotone matrices 



Consider a partially defined totally monotone n x n matrix A in which the defined entries 
in each row are consecutive. 

The range minima data structure that we associated with a secondary node in Section 
13.3.11 is in fact a general data structure for preprocessing such a matrix A, in 0(n log n) 
time, so that we can find the maximum of any row within an interval of columns in 0(1) 
time. (To get a constant query time we need one of the more sophisticated range maxima 
data structures mentioned there.) The size of this data structure is linear in n. 

If A is a double staircase matrix (so the defined entries of each column are also consecutive) 
then we showed in Section 13.3.21 how to construct a data structure so that we can find the 
maxima in any submatrix of A defined by a prefix of the rows and a prefix of the columns, 
in O(log^n) time. This data structure takes 0{na{n) logn) space and 0{na{n) log^ra) time 
to construct. 

The latter data structure can be easily extended so that it can find the maxima in any 
contiguous submatrix of A. The bounds remains the same. Since our application does 
not require general submatrix queries, we leave out the details of this extension, which are 
straightforward. Nevertheless, hoping that applications of this extended structure will arise 
in the future, we state the result explicitly: 

Theorem 4.1. Given a double-staircase totaly monotone nxn matrix A, one can preprocess 
it, in 0{na{n)\og n) time, into a data structure of size 0{na{n)\ogn) , so that, given any 
contiguous submatrix B of A, the maximum entry of B can be computed in O(log^n) time. 
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